Computer and Modernization ›› 2011, Vol. 1 ›› Issue (3): 127-130.doi: 10.3969/j.issn.1006-2475.2011.03.036

• 应用与开发 • Previous Articles     Next Articles

Construction of Common Document Parser Framework for Lucene

LI Hao   

  1. School of Computer Science, South China Normal University, Guangzhou 510631, China
  • Received:2010-11-09 Revised:1900-01-01 Online:2011-03-18 Published:2011-03-18

Abstract: Lucene is an excellent technology frame of full-text retrieval engine of open source code. Firstly, Lucene, an advance full-text retrieval engine is introduced, system structure, running logic, and extend based on Lucene are analyzed in detail. Then for the Lucene document analysis in different types of deficiencies, a common document parser framework and practical examples are given.

Key words: full-text search, Lucene, open source code, document parser

CLC Number: